Big Question: Which state will be the best choice for Amy to migrate?
Interactive: https://chenkaiyao2000.shinyapps.io/shiny/
In most cases, we cannot decide where we are born and live at a younger age, but when we grow up, we get the chance to decide where we want to live by migrating to other states, even other countries. In the first dataset about migration decisions, we specifically look at the place the participants live at 16 years old and the place they live when they are 26 years old. Because before the age of 16 years old, people mainly stay in the states they were originally born in or where their family is. However, during their age from 16 years old to 26 years old, they get the chance to migrate because of the university’s location and working location.
Among my friends, the location is one of the factors when they choose the graduate school and working location. The safety level( especially crime rate) of the state is one of the primary things in consideration. Therefore, we choose the dataset about migration trend and crime rate to explore the relationship in between.
Therefore, let us introduce Amy and help her make the migration decision throughout our exploring process!
Amy is a 20 years old girl studying advertising in Boston University. As a rising sophomore, it is the time to start thinking about graduate school’s application list and jobs. She has a strong desire to leave Boston, where she originally came from. Therefore, location is definitely one key factor in consideration.
Our unifying hypothesis is : The quality of life standard in each state will have an impact on people’s migration decision from 16 years old to 26 years old.
The first dataset we focus on the population with two categorical variables: family income level and race. Therefore, as a starter of the process, we want to get some basic insights about how our sample population looks demographically.
In graph 1, the x-axis is the income level which has been divided into five quantiles: Q1 represents the lowest family income group and Q5 represents the highest. The y-axis represents the percentage of population for different races in each income group. We use the percentage instead of the actual population number since the total population in each group is different and the percentage makes the comparison clearer.
As we can see in the graph, in the Q1 family income quantile group, the population for white group and the black group is relatively similar. Besides that, from Q2 to Q5, as the family income level increases, the white population starts to dominate the percentage while other race groups’ population diminish. For graph 2 ( on the right), axes are switched. The x-axis is the race group, the y axis represents the percentage of each income group. Then we look carefully into each column: For Asian race group and the white race group, the population percentage in each five income quantile are almost equally distributed. The Q1 income quantile takes the least percentage and Q5 quantile takes the largest percentage among the five. For the other three race group ( Black, Hispanic & Other), they are more dominant in the Q1 and Q2 income level group.
Therefore, we assume Amy belongs to the white race group and Q3 income level.
After in order to get the basic understanding of what migration trends look like, we start with a small subset which specifically focuses on migration trends within Massachusetts. The reason we chose Massachusetts is because all the group members are students in Boston University. After staying here for four years, some of us have strong intentions to leave Boston while some of us love the city and want to stay. Therefore, Massachusetts is the most relatable choice and provides some insights for people that tend to migrate after graduation.
We came up with our first question: How do people migrate to other commuter zones within Massachusetts at the age of 26?
Our hypothesis is that Boston will have the largest migration population.
We came up with the graph above. In this graph, the x-axis represents the population in that commuter zone from other commuter zones in Massachusetts at the age of 26. The y-axis represents their destination commuter zones.
We start the analysis with a broader picture: Apparently, Boston has the largest population among five computer zones. The second largest population is springfield. The commuter zone with the least population within the dataset is Nantucket. Then, we dive deeper to look at the construction of the migration population. We exclude the population that stays in its original commuter zone from age 16 to 26. Most of the moving population in Springfield is from Boston. Additionally, Springfield has the second largest migration population and Boston has the largest migration population. Moreover, Boston has the largest population for both migrate in and migrate out.
Therefore, if Amy decides to move within Massachusetts, Springfield might be her first choice since it is a hot pick and very near to Boston.
After looking specifically at Massachusetts’ migration trend, we shift our focus to the broader picture: The migration trend among all states.
According to the graph above, the top five migration states have very distinct results compared to all other states. They are : New York, California, New Hampshire, Rhode Island and Florida. We also noticed that the white group population that migrates into the top five destinations are even larger than the total move in migration for the remaining states.
Based on the demographic picture for Amy, we think if Amy likes warm weather, California and Florida can both be a good option for her to move. If Amy likes to stay in the east-coast, New York is always an option. If Amy doesn’t want very crowded city life, New Hampshire, Connecticut and DC can also be put into the migration list.
After knowing the TOP five migration destinations generally, we started to analyze what features are affecting people’s migration decision from age 16 to age 26.
First, we look at the number of colleges in each state. People between the ages of 16- 26 are exactly the period of study for schools. By knowing the number of colleges in each state, we will be able to find the relation between how the number of c.
The x-axis represents the name of each state in the US. The y-axis represents the total number of colleges in the state. Based on the graph above, we can see that California(CA) contains the largest number of colleges among all states. New York State has the second highest number. More generally, the Top 5 are: CA, NY, TX, FL, PA.
Then, we zoom in to look at the number of each type of college among every state. Since the numbers of Fine Arts School(Arts), Business Secretarial School(Business), Computer Training(Computer), Educational Support Services(Eduserv), and Flight Training(Flights) are small, we decided to only focus on the categories with conspicuous result:
California has the most schools in General, Junior, and Other types of schools. Among New York State, the General types of schools are the largest number with 228 schools. For Taxes, it has similar numbers of schools in Cosme_Barber, General, and Junior types.
Therefore, if Amy wants to go to a graduate school, California seems to be her best choice.
In conclusion, we pick several options for Amy to choose after graduating from Boston University. The Top 3 option is : California, New York and Florida.